Optimal density of bacterial cells

A substantial fraction of the bacterial cytosol is occupied by catalysts and their substrates. While a higher volume density of catalysts and substrates might boost biochemical fluxes, the resulting molecular crowding can slow down diffusion, perturb the reactions’ Gibbs free energies, and reduce the catalytic efficiency of proteins. Due to these tradeoffs, dry mass density likely possesses an optimum that facilitates maximal cellular growth and that is interdependent on the cytosolic molecule size distribution. Here, we analyze the balanced growth of a model cell, accounting systematically for crowding effects on reaction kinetics. Its optimal cytosolic volume occupancy depends on the nutrient-dependent resource allocation into large ribosomal vs. small metabolic macromolecules, reflecting a tradeoff between the saturation of metabolic enzymes, favoring larger occupancies with higher encounter rates, and the inhibition of the ribosomes, favoring lower occupancies with unhindered diffusion of tRNAs. Our predictions across growth rates are quantitatively consistent with the experimentally observed reduction in volume occupancy on rich media compared to minimal media in E. coli. Strong deviations from optimal cytosolic occupancy only lead to minute reductions in growth rate, which are nevertheless evolutionarily relevant due to large bacterial population sizes. In sum, cytosolic density variation in bacterial cells appears to be consistent with an optimality principle of cellular efficiency.


Author summary
The cellular cytosol harbours diverse molecules, whose crowding slows down diffusion and perturbs the chemical equilibrium of biochemical reactions. Reaction rates thus depend not only on the reactants themselves, but also on the background density of other molecules; as a consequence, maximal cell growth requires an optimal density. Here, we simulate a model cell with crowding-adjusted metabolic reaction kinetics. Its cytosol accommodates two types of reactions: metabolic reactions, involving small molecules, and protein production reactions that involve much larger molecules. These two cellular subsystems have distinct optimal densities, and a shift in their relative contribution to the cellular biomass explains the observed 10% difference in cytosolic density between E. coli bacteria growing in nutrient-rich and -poor environments.

Introduction
The dry mass dissolved in the major compartment of bacterial cells, the cytosol, comprises hundreds of molecular species, including proteins, metabolites, polysaccharides, and nucleic acids. These molecules can be roughly classified into two sectors: the ribosomal sector, dominated by ribosomes and tRNA; and the non-ribosomal sector, comprising mostly metabolites, enzymes, and other proteins [1]. The molecules in these two sectors have very different size distributions: the ribosome is 65 times larger than the median enzyme size (2,600kDa [2] vs. 40kDa [3]), and tRNAs are about 300 times larger than typical metabolites (26kDa [4] vs. 89Da, the mass of alanine). The allocation of dry mass between the two sectors of the cellular economy can be summarized by a single parameter, the growth rate μ, with the dry mass fraction of the ribosomal sector increasing almost linearly with μ [1,5]. Accordingly, the ribosomerich cytosol at fast growth in nutrient-rich environments and the ribosome-meager cytosol at slow growth in nutritionally poor environments exhibit very different distributions of molecule sizes.
There are multiple approaches to measure the cytosolic dry mass density. Experiments based on centrifugation directly measured the buoyant density of cells, reporting that the E. coli cell density is approximately independent of growth rate across different nutritional conditions [6,7]. Experiments based on optical intensity measurements of cellular dry weight and cellular volume of E. coli also show an approximate proportionality between weight and volume, again suggesting a single dry mass density across conditions [8]. Recently, more accurate experiments that employed spatial light interference microscopy [9] confirmed an approximately constant dry mass density at 300g/L across minimal nutrient conditions with low growth rates. However, the same study found a roughly 10% less dense cytosol in a rich medium supporting high growth rates [10].
How does the cell control its density during cellular growth? Analyses of individual cells showed that both the cell width and the surface area / mass ratio of a cell are approximately constant throughout its growth trajectory and depend on the nutritional environment [10]. When the nutrients change, the cell changes its molecular machinery to adapt. These changes lead to a shift in Turgor pressure in the cell and change its width. At the same time, the cell also actively shifts the coupling between cell-wall insertion and biomolecule synthesis, leading to a different surface / mass ratio. Essentially, these mechanisms allow the cell to keep a roughly constant density over generational time scales, despite short-term density variations along the growth trajectory. In the current analysis, we are not concerned with the regulatory details of this density homeostasis, but we want to explore how natural selection might influence the target density of the cellular regulation.
We hypothesized that the difference in cytosolic density between slow and fast growth observed by Oldewurtel et al. [10] may be an evolutionary consequence of the differences in molecular composition. Cellular physiology has evolved under natural selection; thus, if the level of molecular crowding-the dry mass density of the cytosol-affects cellular efficiency and hence fitness, we expect density regulation to have evolved to a near-optimal, possibly condition-dependent state. While previous work has explored the influence of (macro-)molecular crowding on bacterial physiology and growth rates, these analyses assumed a constant, hard limit on the total cytosolic protein [11,12] or dry mass [13,14] concentration. These works do not justify the existence and magnitude of the density limit based on physicochemistry, and they cannot explain differences in the level of crowding (dry mass density) across conditions.
Biochemical reaction fluxes typically increase with increasing encounter rates of the molecule species involved; ignoring crowding effects on diffusion, encounter rates increase with increasing density of the respective molecules. At the same time, the molecular crowding caused by other molecule species in the background (volume-excluding co-solutes) affects fluxes in at least three distinct ways. (i) Crowding slows down the diffusion of a catalyst and its substrates, thereby reducing their encounter rates [15,16]. (ii) Crowding limits the available volume and thus reduces a solute's entropy (the "excluded volume phenomenon"), thereby changing the free energies of the molecules involved in the reaction and consequently shifting the equilibrium concentrations of substrates and products [17]. (iii) Crowding can affect the structure of the protein catalyst, the process of its folding, and its conformational stability [18]; these structural changes may disturb the reaction flux if they affect the active site [18][19][20][21][22]. Due to these opposing effects, there may be an optimal cytosolic density where cellular efficiency and hence fitness are maximal.
The effect of crowding depends on the size of the catalyst and its substrate: in the presence of other volume-excluding cosolutes, the larger the size of a solute, the stronger the reduction of its diffusion [23] and the perturbation of its free energy [24,25]. Hence, when the size distribution of the molecules changes, the cell needs to adjust its cytosolic density in order to optimize its physiological efficiency.
In a pioneering theoretical study, Vazquez [26] considered how the objective flux of a metabolic network is affected by crowding, assuming that the enzymes are also the crowders in their own right. The corresponding model accounts for the slowdown of diffusion due to crowding, but ignores the growth-rate dependent role of the ribosomal sector and the corresponding changes in the molecular size distribution. This study found that there exists an optimal cytosolic density, which maximizes reaction fluxes and depends on details of the network, such as the ratio between diffusion limited and transition-state limited reactions. Using an alternative modeling approach, Dill et al. also arrives at a similar conclusion [27].
While these studies demonstrated the existence of a flux-optimizing cytosolic density, natural selection maximizes fitness, not metabolic reaction fluxes. For non-interacting cells in a uniform environment, fitness is closely related to the growth rate [28]. The cellular physiology at maximal growth rate-and hence maximal fitness-can be described mathematically through growth balance analysis (GBA) [13].This modeling framework simulates the balanced growth of a self-replicating bacterial cell while accounting for the major physicochemical constraints on cellular growth: mass balance of metabolism and protein production, non-linear reaction kinetics that depend on the concentrations of catalysts and their substrates, and the effects of molecular crowding. Vazquez [26] assumed that all crowding effects can be quantified by classifying reactions into two types: (1) those in the saturation regime, with [S] � K M , and (2) those in the diffusion limited regime, with [S] � K M . This approach constrains the modeled substrate concentrations of each reaction to be either much smaller or much larger than the corresponding K M , which is incompatible with a realistic modeling of intracellular metabolite concentrations and their effect on molecular crowding [29].
Standard GBA assumes a given, hard limit on dry mass density [13,14] or on total protein concentration [12]. Here, to assess the effects of molecular crowding on fitness, we apply a generalization of GBA that instead describes the kinetics of metabolic reactions and protein translation through crowding-adjusted Michaelis-Menten kinetics [25,30]. We maximize the balanced growth rate while varying the concentrations of transporters, catalytic proteins, and metabolites; these molecular species also form the volume excluding co-solutes in the background of each reaction, affecting diffusion, free energies, and hence reaction kinetics through molecular crowding. Consistent with experimental observations, we find that the cytosolic density of optimal growth strategies depends on the cellular growth rate.

Crowding-adjusted reaction kinetics
The effects of molecular crowding on biochemical reaction kinetics are due to volume exclusion effects. Thus, the relevant parameter for their quantification is not the total mass density (dry mass per volume) but the total volume occupancy ρ, i.e., the fraction of cytosolic volume occupied by dry mass. However, as molecular mass and volume are approximately proportional [31], we treat density and occupancy as interchangeable, subject to a scaling coefficient that quantifies the mass/volume ratio of the cytosolic dry mass. Depending on the external conditions, the volume occupancy of biopolymers in the cytosol of E. coli lies within a range 0.16-0.36 [32], providing a lower limit of the total volume occupancy.
To model biochemical reaction kinetics as a function of molecular crowding, we use a modified description of irreversible Michaelis-Menten kinetics [25,30] where [S] and [E] are the concentrations of the substrate and catalyst, respectively; k cat is the catalytic rate constant (turnover number); and K * M is the crowding-adjusted Michaelis parameter [25,30]. In a typical catalytic reaction, a substrate molecule S encounters a catalyst molecule E to form a catalyst-substrate complex ES, which either proceeds forward to convert the substrate into the product, or reverts back to release the substrate. To derive K * M , we assume that a catalytic reaction can be divided into two subsequent, independent steps that both depend on the cytosolic volume occupancy: (1) S and E diffuse until their encounter ( Fig 1B) and thereafter (2) they bind and unbind reversibly until the reaction proceeds forward and product P is released (Fig 1A). In step (2), we further assume that S and E will stay in close proximity and do not diffuse away from each other. These approximations simplify the model derivation and make the reaction times of the two steps additive [25]. Combining the rate laws of the two steps provides an estimate for the effective Michaelis parameter K * M [25,30]: Here, K 0 M is the Michaelis parameter in the low-crowding limit; Γ is the correction term for the shift in Gibbs free energy ΔG due to molecular crowding, with Γ = 1 in the limit of low molecular crowding; the exponential term exp(−gρ) accounts for the slow-down of diffusion, where the scaling factor g can be estimated from the size of the substrate, i.e., g = g(r S ) for a spherical substrate with radius r S ; and θ is the relative weight between the rate laws of steps (1) and (2), or in other words the relative ratio between time spent on step (1) and on step (2) at low cytosolic occupancy. Transition state limitation dominates when θ is large (θ ! 1; Fig  1A), this gives K * M ' K 0 M G; diffusion limitation dominates when θ is small (θ ! 0; Fig 1B), this gives K * M ' K 0 M expðÀ grÞ. The best available estimate for θ is 2.3, obtained for the ERK MAP kinase phosphorylation reaction [30]; we employ this value in our simulations for metabolic and ribosomal reactions. Small metabolites, however, may diffuse more readily, and so metabolic reactions may have a stronger bias towards being transition state-dominated than ribosomal reactions. Therefore, we also employ an alternative model, in which the metabolic reaction has a θ twice as large as that of ribosomal reactions (θ = 4.6 for metabolic and θ = 2.3 for ribosomal reactions), and examine if this alternative assumption affects our conclusions.

Protein translation favors lower occupancy compared to metabolic pathways
To understand the effect of crowding on catalytic reactions, we first consider a simple, linear biochemical pathway model consisting of N = 20 consecutive enzyme-catalyzed reactions at steady state, i.e., with identical flux through each reaction (Fig 2A). The kinetics of each step in the pathway depend not only on the concentrations of the catalyzing enzyme and the substrate of the reaction, but-through the crowding effects on K * M -also on the concentrations of the molecules involved in the remaining N-1 reactions. We identified the combination of enzyme and metabolite concentrations that maximizes the pathway output per dry mass, calculated from crowding-adjusted kinetics; adding these concentrations, weighted by the respective molecular volumes, resulted in an estimate of the optimal cytosolic occupancy ρ opt for this system. To restrict the number of model parameters, we assumed identical enzyme and substrate sizes as well as identical crowding-adjusted kinetics (Eq (1)) for all reactions in the pathway. We fixed the physicochemical parameter K 0 M ¼ 130mM, which is the median value for metabolic enzymes and their substrates [33] and is also close to the value estimated for the binding of ternary complexes to the ribosome [34], 120μM. The second physicochemical parameter in this model, k cat , appears merely as a scaling factor of the pathway flux, and for simplicity we set k cat = 1s -1 without loss of generality.
We varied the sizes of the catalyst and its substrate, which we assumed to be both spherical. We considered (i) a metabolic pathway, where substrates have sizes typical for metabolites (r = 0.34nm) and catalyst have sizes typical for globular proteins (r = 2.4nm); and (ii) a ribosomal system with sizes resembling those of tRNAs (r = 2.4nm) and ribosomes (r = 13nm). We also assumed the catalyst-substrate complex to be spherical, with a volume equal to that of the catalyst plus the substrate. Note that while cells do not contain pathways of consecutive ribosomes, the model for the 20-step linear pathway depicted in Fig 2A is Fig 3A plots the reaction fluxes. In the metabolic system (blue), the total reaction flux increases monotonically with the cytosolic occupancy within the range explored; note that the term that describes the approximate effects of diffusion, exp(-gρ), breaks down close to ρ = 1 [16,30]. In contrast, in the ribosomal system (red), the total reaction flux reaches a maximum at ρ = 0.21, beyond which potential flux increases due to more and more highly saturated ribosomes are drowned out by increasingly difficult diffusion. Fig 3B plots the flux per biomass investment, a proxy for growth rate, defined as the reaction fluxes divided by cytosolic occupancy. Here, both systems show a clear optimum, with an optimal occupancy of ρ opt = 0.30 for the metabolic system and ρ opt = 0.12 for the ribosomal system. In the light of natural selection on the cellular growth rate, optimal growth-which occurs at a different occupancy than maximal flux-was likely more relevant.
The effect of the occupancy on the pathway output is substantial in the ribosomal system, where a 50% drop in occupancy from ρ opt decreases pathway flux per dry mass by 21%, while it is small for the metabolic system, where a 50% drop in occupancy incurs a flux decrease per dry mass of only 1.3%. Increasing the number of steps in the pathway from 20 to 100 has almost no effect on the optimal occupancies ρ opt for both systems; however, the reduction of flux per dry mass when ρ deviates from ρ opt is substantially increased for longer pathways or and satisfies Eqs (S9), (S10) and (S11). Here l T , l M , and l R are the number of precursor molecules required to synthesize a transporter protein, a metabolic enzyme, and a ribosome in the model, with values 300, 300, and 7.459, respectively.
https://doi.org/10.1371/journal.pcbi.1011177.g002 more parallel reactions (Fig 3, dashed lines). Assuming that metabolic reactions are more biased towards being transition state limited (green curves in Fig 3) has only very small effects. Moreover, the approach to crowding employed in Vazquez [26] also leads to conclusions that are qualitatively similar to Fig 3 (S1 Fig). We conclude that the model predictions regarding the differences between metabolic and ribosomal systems are robust and do not depend on model details.
We conclude that the results from the pathway model are consistent with the experimentally observed trend that the cytosol of fast-growing bacteria, which is dominated by the larger molecules of the ribosomal sector, favors a lower occupancy than the cytosol of a slowly growing cell dominated by smaller metabolites and enzymes. Moreover, our results suggest that compared to cells dominated by the ribosomal sector, cells dominated by the smaller molecules of the metabolic sector may suffer a smaller reduction in biochemical efficiency when the cytosol shifts away from ρ opt .

Optimal occupancy for a self-replicating cell at balanced growth
Can the single-pathway results be generalized to more realistic models of cellular growth, which combine metabolic and ribosomal activities, and where we can directly assess the effect of concentration changes on cellular growth rates? How does the cytosolic occupancy optimal for growth, ρ opt , change when the number of active metabolic reactions changes, e.g., when switching from a minimal medium, where all biomass components have to be synthesized from a single carbon source through multi-step biochemical pathways, to a rich medium that provides many cellular building blocks through simple transport processes?
To answer these questions, we considered a schematic GBA model of a bacterial cell [13], the cytosol of which comprises interdependent ribosomal and metabolic sectors ( Fig 2C). In this model cell, a nutrient s ext is imported into the cell by transport protein T; the nutrient is then converted into precursor p by an N-steps metabolic pathway; finally, the ribosome R uses p to synthesize all catalytic proteins, including T, R, and the metabolic enzymes M 1 to M N . For simplicity, all molecules are again assumed to be spherical in shape. All macromolecules, except for the transporter protein T, are located in the cytosol and are thus crowders in their own right; T is assumed to be fully integrated into the membrane and does not contribute to crowding. As before, the molecules of the metabolic sector are small (metabolites s i with r s = 0.34nm and metabolic enzymes M i with r M = 2.4nm), while the constituents of the ribosomal sector are much larger (precursor p with r p = 2.4nm and ribosome R with r R = 13nm).
To estimate the appropriate number of active enzymatic reactions in the metabolic sector, N, we used flux balance analysis constrained by enzyme concentration [35]; simulations were performed with an improved implementation parameterized for a genome-scale model of Escherichia coli metabolism [36]. We found 259 active metabolic enzymes for growth in a minimal medium with glucose as the sole carbon source; 206 active enzymes for the same medium supplemented with amino acids; and 174 active enzymes for growth in a rich medium (Methods).
To facilitate the numerical determination of the state with maximal growth rate, we approximate the metabolic pathway through a single, lumped reaction with catalyst M and substrate s, scaling kinetic parameters and molecular masses to account for the pathway length The metabolic and ribosomal reactions are described by crowding-adjusted irreversible Michaelis Menten kinetics (Eq (1)), whereas the transporter reaction is described by conventional irreversible Michaelis-Menten kinetics with constant K T M . We investigated how the cytosolic occupancy that allows the fastest growth varies with two parameters, N and [s ext ]. N is the number of enzyme species in the metabolic pathway, and is inversely related to the richness of the nutrient composition and entering the model through scaling the lumped metabolic reaction. [s ext ] is the concentration of the external nutrient, which parameterizes the degree to which the available nutrients are limiting growth. As above, we challenged the assumptions of our model by performing a second set of simulations, allowing for a stronger bias of metabolic reactions towards transition state limitation (θ = 4.6) compared to ribosomal reaction (θ = 2.3).

Optimal occupancy is lower at faster growth due to higher protein translation demands
In the whole-cell model, the optimal occupancy ρ opt generally falls between 0.1 and 0.3 (Fig 4). ρ opt increases when the number of simultaneously active metabolic reactions N increases or when the external nutrient concentration s ext (and consequently the growth rate) decreases ( Fig 5). These effects are due to shifts in the relative dry mass fractions of the metabolic sector (metabolic enzyme M and the substrate s; favoring higher occupancy) and the ribosomal sector (ribosome R and the precursor p; favoring a relatively lower occupancy). Using different θ values for the metabolic and ribosomal systems does not change the qualitative trends of the model (S2  [1,37]. At optimal occupancy ρ opt for different N and s ext , the crowding-adjusted Michaelis parameter K * M of the ribosomal reaction shows a marked increase with ρ opt (Fig 6; two-sided Spearman rank correlation coefficient r = 0.995, P<10 −15 ), consistent with our observations of a strong dependence of the ribosomal flux on ρ in the simple pathway model (Fig 3). In contrast, the K * M of the metabolic reactions is almost invariant when plotted against ρ opt (r = 0.034, P = 0.85). These observations support the intuitive notion that the optimal occupancy in the whole-cell model reflects a tradeoff between the saturation of metabolic enzymes, favoring larger occupancies with higher encounter rates, and the inhibition of the ribosomes, favoring lower occupancies with unhindered diffusion of tRNAs. In agreement with our findings for the simple pathway model, the dependence of the growth rate on ρ appears to be moderate: e.g., at N = 150 and s ext = 1μM, a 50% reduction of ρ from ρ opt reduces μ by only 1.1% (Fig 4).
The slow-down of diffusion and the perturbation of Gibbs free energies have opposing effects on reaction efficiencies. To consider these two effects separately, we define the transition state-perturbation only Michaelis parameter as given by Eq (2) when setting the diffusion scaling exponent to g = 0, and we define the diffusion-perturbation only Michaelis parameter as given by Eq (2) when setting the Gibbs perturbation term to Γ = 1. We used these hypothetical Michaelis parameters in renewed simulations of the pathway models of the metabolic and ribosomal systems (Fig 2A and 2B). In the diffusion-perturbation only model, crowding increases K * M and reduces the reaction fluxes (S7 Fig, dotted lines); in contrast, crowding in the transition state-perturbation only model decreases K * M and boosts the fluxes (S7 Fig, dashed  lines). At high occupancies (ρ≳0.5), the transition state-perturbation effects reach a plateau, whereas the diffusion-perturbation effects continue to increase. Thus, at larger occupancies, the flux-reducing slow-down of diffusion always dominates over the flux-enhancing perturbation of Gibbs free energy when considering their joint effect (S7 Fig, solid lines). In addition, shifting the bias toward stronger transition state perturbation (from solid blue to solid green) delays but does not change the overall trend of K * M increases with increasing occupancy ρ. Comparison of S7A and S7B Fig shows that these trends are largely independent of N, the number of reactions in the system. While the trend for the slow-down of diffusion depends on the sizes of the reacting molecules, the trend for the perturbation of Gibbs free energies is largely independent of molecule sizes.

Optimal occupancy is quantitatively consistent with the observed E. coli dry mass density
To compare our predictions of optimal occupancy to experimental data from E. coli, we first consider the transition from slow to fast growth on minimal media (simulated extracellular nutrient concentration s ext = 0.1μM vs. 1μM, at a constant number of active metabolic reactions N = 250). Here, the optimal cytosolic volume occupancy ρ opt predicted by our whole-cell model decreases from 0.250 down to 0.234 (a 7% drop, Fig 5).
While no direct experimental observations of the occupancy ρ are available, occupancy is expected to scale approximately in proportion to the cytosolic dry mass density, ρ DM [31]. Oldewurtel et al. traces the ρ DM along the growth trajectories of wildtype E. coli cells (MG1655)  Table shows the results of Wilcoxcon rank sum tests that compare the median mass densities across conditions. The measurements indicate that ρ DM does not vary noticeably with μ across most minimal media, i.e., ρ DM �31g/mL when μ�0.7h -1 . While the mass density in growth on minimal medium is statistically significantly different from that in the other two minimal media conditions, the observed ρ DM distributions are strongly overlapping (S8A Fig). In contrast, the observed distribution of mass density drops to a much lower mean value in rich medium, ρ DM = 0.28g/mL, where μ = 1.2h -1 . To compare this data to our predictions, we have to convert mass densities to occupancy values (volume density). As cellular resources are shifted from protein to ribosomal RNA and tRNA with increasing growth rate, and because RNA is denser than protein, the conversion factor between dry mass and volume changes with growth rate (Methods). Taking this into account, we find that the experimental data indicates a decrease of ρ from 0.223 to 0.215 (S1B and S8B Figs), a 4% reduction that is consistent with our prediction of 7% (Fig 5). Second, we consider the transition from fast growth on minimal media to ultrafast growth in rich medium (simulated extracellular nutrient concentration s ext of 1μM vs. 10μM, number of active metabolic reactions N = 250 vs. N = 150). Here, the predicted optimal cytosolic volume occupancy ρ opt decreases by 15%, from 0.234 to 0.204 (Fig 5). Empirical observation shows that ρ DM in E. coli decreases from 0.31g/mL to 0.28g/mL as μ increases from 0.7h -1 to 1.2h -1 (S8A Fig and S1A Table) [10], corresponding to a reduction in occupancy from ρ = 0.215 to ρ = 0.191, an 11% decrease (S8B Fig and S1B Table). Thus, the predicted 15% change in ρ opt also appears to be consistent with the empirically observed reduction in occupancy in the transition to ultrafast growth.
Our simulations also show that even if the cell remains at the state of optimal occupancy, a large reduction in the nutrient level results in only a small decrease of the ribosome's saturation with its substrate (0.81 to 0.72, S5B Fig); at the same time, the ribosome's Michaelis parameter K * M increases by 51% (S5C Fig). This finding is consistent with experimental observations that changes in translation rate per ribosome in E. coli, a direct consequence of ribosomal substrate saturation, are much smaller than the simultaneous changes in growth rate [38]. At increasing nutrient levels, a higher growth rate μ is facilitated by relocating a substantial amount of the ribosome's synthesis capacity from transporter proteins to metabolic enzymes and additional ribosomes (S4C Fig).

Discussion
The linear pathway model shows that reactions with larger catalysts and substrates favor a lower occupancy than reactions with smaller molecules (Fig 3). This effect explains the observation of lower optimal occupancies for decreasing pathway length N in the GBA model cell: the decrease in pathway length simulates the switch from minimal media, requiring on the order of 260 metabolic enzyme species to convert a small number of nutrients to the full range of cellular building blocks, to increasingly richer media, where progressively more biomass components can be taken up directly from the environment, requiring as few as 140 metabolic enzyme species. With decreasing numbers of active metabolic reactions N, the ribosomal sector expands at the expense of the metabolic sector, pushing ρ opt to lower values (S4A Fig). Given the minimalistic nature of the whole-cell model, which assumes that all metabolic reactions follow identical kinetics and approximates protein production through a single Michaelis-Menten type reaction, it is striking that the model not only predicts differences across physiological states, but predicts experimentally observed values [10] quantitatively with an error below 12%.
The whole-cell model predicts that the growth rate μ decreases only mildly when ρ deviates from ρ opt ; for example, μ decreases by only 3% when ρ increases to twice the optimal occupancy ρ opt (Fig 4; N = 150, s ext = 1.0μM). This observation is consistent with an experiment that arrested the volume growth of a yeast cell while cytosolic dry mass continued to accumulate, increasing the concentration of a fluorescent protein-a proxy for dry mass density-to roughly twice the wildtype value [39]. These results suggest that the dry mass accumulation rate largely remains constant throughout the volume growth arrest, even when the cytosol density reaches twice the wildtype level.
In the whole-cell model, a 10% deviation of ρ from ρ opt results in a 0.02% drop in μ (Fig 4); using growth rate as a proxy for fitness, this corresponds to a selection coefficient s = 2x10 -4 . The effective population size of most bacterial species is on the order of N e = 10 8 [40], and we thus have s � 1/N e = 10 −8 ; accordingly, natural selection would be sufficiently strong to explain the difference in average ρ DM observed between the two physiological states. Experiments show substantial between-cell variation in each nutritional condition (S8 Fig) [10]: the observed dry mass densities show coefficients of variation (standard deviation / mean) of around 5%. According to the whole-cell model, the corresponding difference in occupancy corresponds to a selection coefficient of s = 10 −4 � 1 / N e , indicating that this large cell-to-cell variation is not selectively neutral but persists despite negative selection arising from selection on rapid growth.
Thus, while our deterministic model predicts some variability, it does not predict variation of the observed magnitude. It thus appears that additional factors beyond selection for fast growth modify the density of individual cells either actively, through regulation, or passively, through the coupling of mass density to other factors. However, these observations do not invalidate our model, but rather indicate that other factors can be seen as perturbations of the state optimal for rapid growth, because our results are consistent with the average densities of cells across different growth conditions. One such influence may arise from the coupling of cell size to DNA replication. As growth rate increases, the number of replication rounds within a cell also increases (see Fig 9 of Ref. [41] for an illustration), and additional intracellular volume may simply be required to accommodate the parallel replication processes.
Questions about the optimal allocation of protein resources can be addressed by maximizing the growth rate (or flux) in computational models of cellular growth with fixed, crowdingunaware kinetic parameters. While such simulations produce meaningful predictions for the relative amounts of the proteins, they cannot limit absolute protein concentrations [42]. To solve this problem, existing models [13,14] implement hard, phenomenological constraints on the total concentration of protein or cellular dry mass, based on experimental observations that found these to be approximately constant across growth conditions [10,37,43,44]. Our results elucidate the biophysical origin of these observations, indicating that the cellular dry mass density represents a compromise between the saturation of metabolic enzymes with their substrates and the effects of reduced diffusion on the effective affinity of the ribosome for its much larger substrate, the ternary complex.

Crowding-adjusted Michaelis-Menten kinetics
Macromolecular crowding affects the flux of a metabolic reaction in multiple ways. It can (i) slow down diffusion; (ii) affect the free energy of substrate, catalyst, and the substrate-catalyst complex and thereby change their relative equilibrium ratios; and (iii) disturb the folding of a protein and affect the shape of the active site. In our modelling framework, we followed the derivation proposed in Minton [25] that systematically accounts for the effects of crowding on metabolic fluxes caused by effects (i) and (ii).
In this section, let us consider the metabolic reaction carried out by an enzyme E that converts substrate S into product P, in the presence of other volume-excluding co-solutes that, collectively, constitute the dry mass of the solution. The metabolic reaction is described by the chemical equation following Michaelis-Menten kinetics: It is described by two parameters: the enzyme-substrate dissociation (or Michaelis) parameter K M~koff /k on , and the catalytic rate constant (or turnover number) k cat . Note that while K M is usually assumed to be invariable and is hence referred to as "Michaelis constant", we here examine crowding-dependent changes in K M und hence refer to it as the "Michaelis parameter". K M is sometimes defined as K M = (k off +k cat )/k on . However, we here use the definition K M = k off /k on , as this is the appropriate form when working with complex enzyme reactions [45].
For example, in a metabolic reaction with two substrates, A and B, the binding of the substrates to the enzyme involves kinetic parameters k A on , k A off , k B on , and k B off . The concentration of the enzyme complex, formed after both substrates have bound to the enzyme, can be described by on without directly involving the on-and off-rates. Thereafter, the flux of the reaction can be calculated by multiplying the complex concentration with k cat . Inclusion of k cat into K A M and K B M complicates the derivation. For this reason, k cat is not included in K M in Minton [25], and we followed this convention in the current study.
We assume that all reactions follow effectively irreversible Michaelis Menten kinetics and thus free energy changes are irrelevant to the transition rate k cat ; moreover, we ignore crowding effects on the enzyme structure, and thus k cat is not affected by crowding. To derive the effect of crowding on the Michaelis parameter K M , we divide a catalytic reaction into two independent, consecutive steps: (1) S and E diffuse until they meet, and then (2) S and E bind and unbind reversibly until the reaction proceeds forward to make P.

Step (1): The substrate-catalyst encounter
The encounter rate between S and E depends on the cytosolic occupancy. Its rate law is similar to that of a diffusion-limited catalytic reaction, in which the ES encounter rate is low and their encounter is the rate determining step; ys soon as the ES complex is formed, the reaction quickly proceeds, converting S into P and releasing E. At given concentrations of E and S, the rate of formation of ES is thus mostly determined by the rate of encounter, which is proportional to the sum of E's and S's diffusion coefficients. A reduction of the diffusion rate shifts the equilibrium between the concentrations of the individual molecules and the complex ES; this can be accounted for in the following way [25]: here, K diff M is the hypothetical diffusion limited Michaelis parameter, while K 0 M is the Michaelis parameter in the low-crowding limit; ρ is the volume occupancy of the volume-excluding cosolutes (dry mass) of the solution (with range 0 < ρ < 1), and g is a function that depends on the shape of E, S, and other volume-excluding co-solutes. Since S is typically much smaller than E, the diffusion coefficient of S in a crowded solution is in general much higher than that of E. Hence, we estimate this scaling term exp(−gρ) solely from the diffusion coefficient of S. Approximating S as a sphere of radius r S , we can write it as exp(−g(r S )ρ).
The bacterial cytosol is crowded, slowing down the diffusion of all molecular species. The extent of this slow-down, however, is non-uniform and depends largely on the size of the affected molecule: the larger the molecule, the more it is slowed down. The slow-down of diffusion in the E. coli cytosol is summarized by the following empirical scaling law, which was inferred from molecular dynamics simulations [23]: where r h = 1.3(r + 1.4Å) is the hydrodynamic radius of a molecule with radius r [46], i.e., its effective radius including the attached water molecules; D 0 (r h ) is the diffusion coefficient in the low crowding limit, while D cyto (r h ) is the diffusion coefficient in the crowded cytosol condition; ξ = 0.51nm is the average distance between the surfaces of volume-excluding co-solutes in the cytosol of E. coli; R = 42nm is the radius of the largest common crowders in the cytosol; and a = 0.53 is an empirical scaling factor [23]. Note that in general, the parameter ξ depends on ρ; however, as we are only interested in the relationship between reaction fluxes, growth rate, and ρ in a small range centered around the native E. coli cytosolic density, we can approximate ξ by a constant in our analysis. As the cytosolic dry mass density is~0.3g/mL [10] and the mass-to-volume-ratio of protein is 1.35g/mL [47], the cytosolic volume occupancy ρ is approximately 0.22 = 0.3/1.35. If the reaction rate is proportional to the rate of encounter between E and S, and this rate of encounter is in turn approximately proportional to the diffusion coefficient of S, then Eq (S2) can be used to calculate the scaling factor g(r S ): and the rate of this reaction is denoted as k diff :

Step (2): Repeated binding and unbinding of substrate and catalyst
The rate law of this step is related to a transition-state limited catalytic reaction, in which the rate of encounter of E and S is much larger than the rate of conversion of the ES complex into the product P. In this case, the complex ES exists in near equilibrium with E and S, and macromolecular crowding affects the reaction rate parameter mainly through shifting this equilibrium. To assess the magnitude of this effect, we consider the reversible reaction and denote the adjusted equilibrium Michaelis parameter in this hypothetical transition-limited case as K ts M [25]: where ɣ i denotes the activity coefficient of molecular species i, G ¼ g E g S g ES , and K 0 M ¼ k off k on is the enzyme-substrate dissociation parameter in the low-crowding limit. The activity coefficients are defined as here, G is the Gibbs free energy of the system; [i] is the concentration of molecular species i; μ i is the chemical potential of i, and μ i ideal is the chemical potential in an idealized situation, i.e., without intermolecular interactions and in the absence of other volume-excluding co-solutes. In other words, the equilibrium of the reaction will be identical to the dissociation parameter K 0 M if the system is ideal. The Γ term, therefore, accounts for the deviation of the Gibbs free energy from the idealized situation.
The ɣ i of each molecular species i can be written as an expansion in terms of the concentrations of all molecular species [48]: The coefficients B ij (B ijk ,. . .) reflect the interaction between 2 (3,. . .) molecular species. For example, the coefficient B ij is given as [49]: where N A is Avogadro's number, r is the distance between the center of mass of molecular species i and j, and U ij (r) is the potential of average force acting between the two molecular species. While the interaction potential among multiple molecular species is complex, it has been found that the colligative properties of solutions of globular proteins can be well accounted for over a wide range of concentrations by using a simple hard sphere potential, where two molecules cannot overlap but do not interact otherwise (see Minton [25] for a review): The scaled particle theory applies this rigid sphere potential to calculate the activity coefficient ɣ i of molecular species i (Boublík, 1974): Here, w i is the concentration (number density) of molecular species i; hhXii = ∑ i w i X i is a weighted sum of property X, with hh1ii = ∑ i w i ; S i ¼ 4pr 2 i is the surface area and V i ¼ 4 3 pr 3 i is the volume of molecular species i; in addition, hhrii = ∑ i w i r i and hhrii 2 ¼ P i w i r 2 i . As in the hypothetical transition-state limited case, the reaction rate parameter is proportional to the concentration of the enzyme-substrate complex, and a shift of its equilibrium constant by Γ leads to a corresponding shift of the reaction rate, quantified by the rate parameter k ts [25]:

Combining step (1) and (2) for the general formulation
We assume that a catalytic reaction is divided into two independent steps that occur in tandem: (1) the encounter of S and E, whose rate law is proportional to that of a diffusion limited reaction, and thereafter (2) their reversible binding and unbinding until S is converted into P, whose rate law is proportional to a transition-state limited reaction. The crowding-adjusted reaction rate k is obtained by adding the inverse rate parameters (i.e., the reaction times) of the two steps [25,30,50]: Given Eqs (S1) and (S4), this leads to [30]: Here, y � k ts k diff j r¼0 quantifies the relative contributions of step (1) and (2) to the overall reaction, or in other words the ratio between times spent on step (1) and on (2)  ] in the equations. From now on, we describe the reaction rate as the flux of the reaction, and denote it as v. As we assume that the reaction is irreversible and hence crowding does not influence k cat , to arrive at crowding-adjusted Michaelis-Menten kinetics, we have to scale the Michaelis parameter, which arises from considerations on the equilibrium between the enzyme-substrate complex and its constituents. Thus, v=v the Michaelis parameter in the low crowding limit; then the crowding-adjusted Michaelis parameter is and so the flux can be written as The θ of ERK MAP kinase phosphorylation reaction was estimated to be 2.3 [30]; we assume that this value is representative for cellular enzymes and use it for modeled reactions. To understand how the choice of θ affects the model behavior, we also ran a model where metabolic reactions have θ = 4.6.
The concentration of the enzyme-substrate complex, ; at the same time, K * M depends on the concentration of different molecular species, including [ES], through Eq (S3). To find a self-consistent solution for these two quantities, we iterate these two equations until convergence. Initially, we are given the total concentration of the enzyme and substrate, [E total ] and [S total ], which are invariant throughout the iterations. The level of crowding is specified by three molecular species-[E free ], [S free ], and [ES]; let us also denote E n free � � , ½S n free �, and [ES n ] to represent their corresponding values in the n th iteration step. At the initial interaction step (n = 0), we set E 0

Single-pathway model
We considered a simple model of a linear pathway to investigate how the size of the substrate and catalyst of a metabolic reaction affect the optimal cytosolic occupancy; here, optimality is defined as a maximal pathway flux per unit dry mass, calculated from crowding-adjusted kinetics. The pathway is divided into N steps, where E n is the catalyst of step n, which converts its substrate s n into s n+1 , the substrate of the next step (Fig 2A). We assume that all internal metabolite concentrations are in steady state (i.e., producing and consuming fluxes cancel exactly), and we ignore the dilution of intermediates. Thus, all reaction fluxes have the same value, v. We assume that s 1 is replenished by a flux v s 1 ¼ v, which is not modeled explicitly.
We assume that the N reactions are described by crowding-adjusted Michaelis Menten kinetics with identical k cat and K 0 M . We further assume that the N catalyst species and the N substrate species are spherical, with radius r E (volume V E ) for the catalysts and radius r s (volume V s ) for the substrate species. These assumptions simplify the solution space of the model, as in the optimal steady state, all catalysts have equal total concentrations ([ with the Avogadro number N A . The flux through the pathway per unit volume is Accordingly, the flux per unit dry mass is As we ignore crowding effects on the turnover number, k cat acts only as a scaling factor, and we thus set k cat = 1 for simplicity. We set the Michaelis parameter at the low crowding limit to K 0 M ¼ 130mM, which is the median K M of metabolic enzymes [33], and is also very close to the Michaelis constant of the ribosome estimated from the diffusion limit without molecular crowding [34].
Because we assume identical kinetics of all reactions and ignore the dilution of intermediates through growth, the whole pathway is equivalent to a single reaction with re-scaled kinetics. We still describe it as an N-steps pathway, as this more faithfully reflects the situation in the real cell, allowing us to use realistic parameter values. Moreover, while mathematically, N represents a scaling factor, it has an intuitive biological interpretation.
The same equations can be used to describe a system of N parallel enzymatic reactions with identical fluxes, with only an additional multiplication by N in Eq (S8), μ parallel = μ N (Fig 2B). Here, catalyst E n of reaction n converts substrate s n into the end product, and the consumption of s n is compensated by a flux v s n that supplies s n at an equal rate.
We consider two systems of substrate and catalyst sizes: metabolic and ribosomal. In the metabolic system, we use r s = 0.34nm for metabolites (the approximate radius of the amino acid alanine [51]) and r E = 2.4nm for globular proteins (an "average" protein in E. coli has a mass 40kDa [3], while a globular protein with mass 50kDa has a radius r = 2.4nm [52]; in an alternative estimation, the radius of a typical globular protein is approximately r = 2.5nm [53]). In the ribosomal system, we use r s = 2.4nm for the ternary complexes (the gyroscopic radius of tRNA is estimated to range from 2.33nm to 2.46nm based on Eq. (7) of Hyeon et al. [54]). We use a radius of r E = 13nm for the ribosome, as the diameter of a ribosome is reported to be 20nm -30nm [53,55,56]. We assume that the catalyst-substrate complex is also spherical, with a volume equal to the sum of the substrate's volume and the catalyst's volume. For both metabolic and ribosomal systems, we used crowding-adjusted Michaelis-Menten kinetics with θ = 2.3; to explore the effect of assuming the same θ value for both systems, we alternatively examined a model where we set θ to 4.6 for the metabolic system, as the metabolic system may have a higher diffusion efficiency than the ribosomal system.
For a fixed value of the total occupancy ρ, we calculated the specific flux μ using MATLAB, while (i) varying the occupancy ρ in steps of 0.01 from 0.01 to 0.8, with additional, finergrained steps of 0.001 from 0.100 to 0.360, and (ii) varying the ratio of the volume occupied by the substrates s, from 0.1% to 97.7%, with an increase by a factor of 1.0023 at each step.

Single-pathway model: The Vazquez approach
In Vazquez [26], reactions are classified into two contrasting types: (1) those in the saturation limit, with [S] � K M , and (2) those in the diffusion limit, with [S] � K M . The rate of reactions in the diffusion limit is modified by an exponential term, exp(−5.8ρ), where 0�ρ�1 is the cytosolic occupancy. In addition, crowding increases the contact with enzymes and substrates, and so the reactions are sped up by the term 1/(1 − ρ). Overall, the reaction rate of a reaction in saturation is while the reaction rate in the diffusion limit has an extra exponential term, We simulated the metabolic and ribosomal systems based on these two equations, setting k cat = 1 and assuming the metabolic system to be in saturation and the ribosomal system to be in diffusion limit (S1 Fig).

Model cell with a metabolic and a ribosomal pathway
To more faithfully represent a complete cell and to study the tradeoff between metabolic and ribosomal reactions, we also consider a balanced growth model with a metabolic sector and a ribosomal sector (Fig 2C). As seen from the results for the pathway models, the two types of reactions have very different optimal conditions: the metabolic sector involves smaller catalysts and substrates than the ribosomal sector, and hence has maximal specific fluxes at a higher cytosolic occupancy.
In this model (Fig 2C), the transporter T imports the external substrate s ext into the cytosol, where it is now labeled s 1 . The transport reaction is described by ordinary, crowding-unaware irreversible Michaelis Menten kinetics, with flux We set k T cat ¼ 13:7s À 1 (the median k cat of enzyme reactions [33] (Bar-Even et al., 2011)), and k T M ¼ 1mM (close to the growth limiting glucose concentration of E. coli [57]).
The metabolic sector comprises an N-steps linear pathway of metabolic reactions, identical to the one studied in the simple pathway model: the enzyme of metabolic reaction n (1 � n � N), denoted as M n , converts substrate s n into s n+1 ; in reaction n = N, M N converts one s N into one precursor p [12]. These N reactions follow crowding-adjusted irreversible Michaelis Menten kinetics with identical rate parameters k M0 M ¼ 130mM and k M cat ¼ 13:7s À 1 (the median K M and k cat of enzyme reactions [33]). As before, all N substrates have radius r s = 0.34nm, while all N enzymes have radius r M = 2.4nm; as before, we assume that the enyzme-substrate complex is spherical and occupies as much volume as one substrate plus one enzyme molecule. To facilitate the numerical solution of the model, we assume that all metabolite concentrations [s n ] and also all enzyme concentrations [M n ] are identical. This corresponds to the optimal balanced growth solution when the differential dilution of intermediate metabolites is ignored [33]; thus, we treat the dilution of intermediate metabolites only approximately here. In the balanced growth condition, the production rate of each substrate is equal to its consumption rate plus its (approximate) rate of dilution through growth, so that its concentration remains stable. We define the total substrate concentration The ribosomal sector comprises the ribosome (R) and the protein precursor (p): R converts p into the N+2 types of protein in the model, N metabolic enzymes (M n ), the ribosome (R), and the transporter (T). As before, the radius of p is r p = 2.4nm and the radius of R is r R = 13nm; their complex is assumed to be spherical and to occupy as much volume as one precursor plus one ribosome molecule. The ribosomal conversion rate is described by crowdingadjusted irreversible Michaelis Menten kinetics with parameters k R0 M ¼ 120mM [34] and k R cat ¼ 22:0s À 1 [12,34], and the consumption rate of p by R to make proteins is k R . The ribosome converts l T = 300 precursors into one transporter, l M = 300 precursors into one metabolic enzyme, and l R = 7459 precursors into one ribosome [12].
Note that the reactions that produce and consume the precursor p do not conserve volume. The reason is that to model a realistic cell, we envision the precursor as a charged tRNA, only the amino acid part of which is (i) produced by the metabolic pathway and (ii) integrated into the growing protein. The metabolic pathway only provides the amino acid, while the pool of free tRNAs is not explicitly modeled. For this reason, the size of p is substantially larger than the size of the metabolite s N consumed in its production. Conversely, the ribosome consumes 300 precursors to produce a single transporter or enzyme, which are both substantially smaller than the combined volume of the 300 precursors. Here, we envision that the tRNA part of p is set free again and can be re-charged through M N . This treatment assumes that the concentration of free tRNA is so low that we can ignore its dilution through growth and its contribution to the cytosolic crowding.
The . The corresponding molecules, along with their complexes, are also the crowders that slow down diffusion and disturb Gibbs free energies. The cytosolic occupancy ρ, 0 � ⍴ � 1, is with the Avogadro number N A . The growth rate μ of this model cell can be expressed as the flux through the ribosome reaction divided by the total protein concentration, In the balanced growth state, the production of s and p is offset by their consumption and dilution by growth, For a given number of enzymes N and occupancy ρ, we solved this model numerically. As a preliminary step, we used the BARON algorithm [58] implemented in Pyomo [59,60] and assumed normal crowding-unaware irreversible MIchaelis Menten kinetics, maximizing the growth rate μ over the space of concentrations ( ), subject to the constraints of Eqs (S9) and (S11). In the main model, we assumed θ = 2.3 for both metabolic and ribosomal reactions. As an alternative, we also examined a model with metabolic reactions biased more toward the transition state limit, setting θ = 2.3 for ribosomal reactions and θ = 4.6 for metabolic reactions.
Using the BARON solution as a starting point, we applied the SLSQP algorithm within the function "minimize" in SciPy [61], now using crowding-adjusted Michaelis-Menten kinetics. It is not clear if the optimization problem has a unique solution; to increase the probability that the solution is a global maximum, we repeatedly ran the SLSQP algorithm at least 20 times for each (N,ρ) combination, and picked the solution with the highest μ. For each simulation, at least half of the independent runs supported the same, maximal optimum.

Estimating the number of enzymes in the metabolic pathway
To obtain a realistic estimate of the number of simultaneously active metabolic reactions in a bacterial cell, we performed flux balance analysis simulations accounting for molecular crowding in terms of a hard limit on the total cellular protein concentration. Simulations were run using "sybil", an R library for efficient constraint-based analyses [36,62,63]. We used sybilccFBA [64,65], a re-implementation of the MOMENT algorithm with an improved treatment of multifunctional enzymes, which maximizes the biomass production rate while constraining the sum of cytosolic metabolic enzyme concentrations at the experimentally observed level.
We used a sybilccFBA implementation of the iAF1260 stoichiometric model [66], parameterized with turnover numbers for E. coli [64,65]. We considered four different nutritional environments. In each case, we counted the number of active metabolic reactions with both substrates and products located in the cytosol, and with a flux >10 −6 mmol/(gram Dry Weight)/h to filter out numerical noise; we enumerated the enzymes supporting these reactions, filtering out those with proteome fraction lower than a cutoff (10 −9 ; for reference, the most abundant enzyme has a density~10 −3 ).
The estimated numbers of active metabolic enzymes and reactions in the different conditions are as follows: (i) 259 active enzymes and 349 active reactions in a minimal medium with glucose as the sole carbon source, corresponding to slow to intermediate growth with a cytosol dominated by the metabolic sector [64,67] Table); (ii) 206 active enzymes and 288 active reactions in the same minimal glucose medium supplemented with 20 amino acids, simulating intermediate growth [64,67] Table); (iii) 174 active enzymes and 250 active reactions in a rich medium, corresponding to fast growth and a cytosol dominated by the ribosome and its substrates [64,68] Table); and (iv) 140 active enzymes and 234 active reactions in an extremely rich medium, where all exchange reactions in the iAF1260 E. coli model are allowed to be active.

Conversion of experimental dry mass density to occupancy
An empirical growth law connects the RNA/protein mass ratio (r) of an E. coli cell with its growth rate μ: r = 0.087 + μ/(4.5h −1 ) (Eq. (1) of Scott et al. [1]); a further experiment reports the subtle deviation of this relationship from linearity (See Fig 1D of Dai et al. [38]). At μ = 0 and 0.7h -1 , r is measured to be 0.086 and 0.225 (S3 Table of Dai et al. [38]). The average specific density of protein is 1.35 g/mL [47], while that of the E. coli 70S ribosome is (1.637 g/mL), where RNA constitutes 61.87% and the remaining mass fraction is assumed to be proteins [69]. From these relationships, we can obtain the RNA density, resulting in a value of 1.81 g/ mL.
If we ignore all molecular species in the dry mass except protein and RNA (which constitutẽ 75% of the dry mass in an E. coli cell [70][71][72]), we can estimate the overall specific density of dry mass (i.e., mass per volume of dry mass [g/mL], denoted as D) from r using the following formula And then ρ can be estimated from D and cytosolic dry mass density ρ DM Applying Eq (S12a), we find that the overall specific density of dry mass D increases from 1.39g/mL to 1.43g/mL when μ increases from~0h -1 to 0.7h -1 (r changes from 0.86 to 0.243, reported from Dai et al. [38]). Oldewurtel et al. [10] found that up to these growth rates, the observed cellular dry mass density in E. coli is approximately constant at ρ DM = 0.31g/mL. Plugging these numbers into Eq (S12b), the empirical ρ decreases from 0.223 to 0.215, a 4% reduction. This shows that the cytosolic volume density ρ changes even when the cytosolic mass density does not change.
The same set of experiments also showed that ρ DM in E. coli decreases from 0.31g/mL to 0.28g/mL as μ increases further, from 0.7h -1 to 1.2h -1 [10]. Within this range of μ, r increases from 0.225 to 0.33 (r = 0.33 when μ = 1.17h -1 ; S3 Table of Dai et al. [38]), and D changes from 1.43g/mL to 1.46g/mL (based on r = 0.33) [47,70,73]. Accordingly, the empirical occupancy ρ decreases from 0.215 to 0.191, a decrease of 11%. S8 Fig shows the boxplot of the distribution of ρ DM and ρ in different nutritional environments. It shows a trend of roughly constant ρ DM but decreasing ρ with increasing μ at low growth conditions. S1A and S1B Table show the tests that compare the distributions of ρ DM and ρ, respectively, between different nutritional conditions.  S1A Table for the results of the tests). Each bar in (B) corresponds to the ρ estimated from the ρ DM measurements of the same condition using Eq. (12a) and Eq. (12b). r¸the RNA/protein mass ratio, is necessary in the calculation of ρ; we estimated r from the growth rate μ using the MATLAB interpolation function 'interp1' and the μ-r measurements of wildtype E. coli in S3  S1B Table for the results of the statistical tests). Excluding MM+gly, this graph shows a trend of decreasing ρ with increasing μ. (DOCX) S1 Table. Pairwise comparisons, based on two-sided Wilcoxon rank-sum tests, of cytosolic mass density ρ DM (A) and cytosolic occupancy ρ (B) of wildtype E. coli (MG1655) cells cultured in different nutritional conditions. ρ is calculated from ρ DM and r, the RNA/protein mass ratio, using Eq. (12a) and Eq. (12b). r is estimated from the μ using the MATLAB interpolation function 'interp1' and the μ-r measurements of wildtype E. coli reported in S3 Table of Dai et al. 2016. Each cell in the two tables corresponds to the p-value of the Wilcoxon rank-sum test, colored with red (significantly different / dissimilar) and blue (not significantly different / similar) with cutoff significance level 0.05. Symbols of the condition labels are; MM: minimal medium; man: mannose; gly: glycerol; glu: glucose; CAA: casamino acids; RDM: rich dry medium. S1A Table. P-values of spearman correlation to compare the cytosolic dry mass density ρ DM between different nutritional conditions. S1B Table. P-values of spearman correlation to compare the cytosolic occupancy ρ between different nutritional conditions. (DOCX) S2